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22122697 [ ref | NP_66627 4 . 1 | hypothetical protein MGC3804 6 . 
20150316 |pdb| 1J4S 1 A Chain A, Structure Of Artocarpin: A , 
26347287 | dbj IBAC37292.1 [ unnamed protein product [Mus mu, 
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1| COG0642: Signal transduction... 


21 


48 






38086162|ref |XP 133316.21 


similar to hypothetical protei... 


21 


48 


D 


qi 


20808945|ref | NP 624116.11 


hypothetical protein [Thermoan. . . 


21 


48 




qi 


6753710 |ref|NP 034228.11 


opsin (encephalopsin) ; encephal . . . 


21 


48 


u 


qi 


45547389|ref | ZP 00187440. 
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PCMl [Takifugu rubripes] 
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4 72162 58 | emb | CAG05954 . 1 | 


unnamed protein product [Tetrao... 
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10719868 | sp | Q9QYC0 | ADDA MOUSE Alpha adducin (Erythrocyte... 


91 

Z 1 


o u 


II 


qi 


27676688 | ref |XP 218355.11 
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34873860|ref |XP 343971.11 


similar to Mox-1 [Rattus norve . . . 
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similar to RIKEN cDNA 1110002H. . . 
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unnamed protein product [Tetrao. . . 
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Addl protein [Mus musculus] 
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unnamed protein product [Mus mu. . . 


97 
z / 


f. 

Oj 


n 

KJI 


qi 
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adducin 1 (alpha) [Mus musculus... 
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461774 | sp| P23805 | CONG BOVIN Conqlutinin precursor >qi|34... 
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adducin 1, alpha; alpha-ADD [Ra... 
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mesenchyme homeobox 1 [Mus muse. . . 
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50510951 |dbi |BAD324 61.1| 


mKIAA1482 protein [Mus musculus] 
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12842851 |dbi |BAB25758.1| 


unnamed protein product [Mus mu. . . 
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34881239|ref |XP 228551.21 


similar to CCG1 [Rattus norveg. . . 
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forkhead box D4-like 1 [Homo s... 
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predicted protein [Neurospora . . . 
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AEL017Wp [Eremothecium gossypi... 
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FOXD4 [Gorilla gorilla] 
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hypothetical protein MGC75588 . . . 
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| similar to synaptic SAPAP-inte. . . 


26 


116 


n 

19 


qi 


50513963 | pdbl 1VB0I A Chain A, Crystal Structure Of Artoca... 
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somatostatin receptor-interacting. . . 
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| SH3 and multiple ankyrin repea. . . 
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unnamed protein product [Mus mu... 
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Eph-like receptor tyrosine kinase... 
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hypothetical protein CNBM1020 [C... 
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I putative hydrolase [Thermus th. . . 
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unnamed protein product [Mus mu. . . 
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Hypothetical protein HHV4gp016 . . . 
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4758284 |ref |NP 004432.1 | 


ephrin receptor EphBl precursor. . . 


25 


156 


11 


qi 

—3 
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| similar to CG33130-PA [Rattus . . . 
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38089043|ref |XP 133997.4 


I RIKEN cDNA 2310057 J16 [Mus mus... 
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hypothetical protein MG10317.4 [... 
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21224968 |ref|NP 630747.1 


| conserved hypothetical protein. . . 
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4104411 | qb|AAD02030. 1 ] 


Eph-like receptor tyrosine kinase... 
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4 9257 7 90 |qb|AAH747 09.1 | 


Unknown (protein for MGC: 69445) ... 
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34327 98 8 |dbi IBAB67 820.2 | 


KIAA1927 protein [Homo sapiens] 
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2 917 954 4 | qb|AAH4 8787 . 1 | 


Similar to RIKEN cDNA 4 93054 1M15 . . . 
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I hypothetical protein [Yersinia... 
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123284 71 |dbi 1BAB21131.1I 


P0416D03.19 [Oryza sativa (japo... 
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Alignments 




□ > qi j 626062 |pir | IJC2217 major surface glycoprotein 5 - Pneumocystis carinii 

qi 1 4 2578 5 | dbj 1 BAA04 8 51 . 1 [ major surface glycoprotein [Pneumocystis carinii] 

qi 1 743315 [prf 1 12012229A major surface glycoprotein : ISOTYPE=MSG5 
Length = 1076 

Score =31.6 bits (67), Expect =1.9 

Identities = 13/20 (65%), Positives = 13/20 (65%), Gaps = 3/20 (15%) 

Query: 3 QGPAGSGWEEGSGSPPGVTP 22 

Q PAGS GSGSPP V P 
Sbjct: 816 QAPAGSS GSGSPPAVPP 832 
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□ > qi 1 3098 18 1 qbj AAA33784 . 1 1 antigen 
Length = 386 

Score =31.6 bits (67), Expect =1.9 

Identities = 13/20 (65%), Positives = 13/20 (65%), Gaps = 3/20 (15%) 

Query: 3 QGPAGSGWEEGSGSPPGVTP 22 

Q PAGS GSGSPP V P 
Sbjct: 126 QAPAGSS GSGSPPAVPP 142 



□ > qi 1 22958582 | ref [ ZP 0Q0Q6250.il COG1360: Flagellar motor protein [Rhodobacter sp 
Length = 366 

Score = 29.1 bits (61), Expect = 11 

Identities = 11/16 (68%), Positives = 13/16 (81%), Gaps = 2/16 (12%) 

Query: 4 GPAGSGWEEGSGSPPG 19 

GPAG G EG+G+PPG 
Sbjct: 18 9 GPAGPG--EGTGAPPG 202 



□ > qi 1 38075587 | ref | XP 356708 . 1 1 U similar to transcription factor Oct-3, long spli 
[Mus musculus] 
Length = 153 

Score =28.6 bits (60), Expect = 15 

Identities - 11/17 (64%), Positives = 12/17 (70%), Gaps = 3/17 (17%) 

Query: 2 MQ — GPAGSGWEEGSGS 16 

MQ GP G+GWEE S S 
Sbjct: 1 MQIGGP-GAGWEENSSS 16 



□ > gi j 2360Q471 1 qb | AAN3905Q . 1 1 putative phosphate transporter OsPT9 [Oryza sativa ( 
cultivar-group) ] 
Length = 582 

Score =28.6 bits (60), Expect = 15 

Identities = 12/18 (66%), Positives = 14/18 (77%), Gaps = 2/18 (11%) 

Query: 5 PAGSGWEEGSGSPPGVTP 22 

P+GS +GSGS PGVTP 
Sbjct: 55 PSGSV--DGSGSGPGVTP 70 



□ > gi | 7 341204 | qb| AAF61213. 1 | NS protein [pig-tailed macaque parvovirus] 
Length = 672 

Score =28.2 bits (59), Expect = 20 

Identities = 13/20 (65%), Positives = 13/20 (65%), Gaps = 3/20 (15%) 

Query: 10 WE--EGSGSPP-GVTPLFSP 26 
WE E SGSPP TPL SP 
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Sbjct: 542 WESSEDSGSPPRSSTPLASP 561 



□ > qi | 20150320 | pdb 1 1J4T I A H Chain A, Structure Of Artocarpin: A Lectin With Mannos 
Specificity (Form 2) 

qi 1 20 150321 | pdb | 1J4T i B H Chain B, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi | 20150322 [pdb 1 1J4T j C 111 Chain C, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi | 20150323 | pdb | 1J4T | D II Chain D, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi|201503241 pdb | 1J4T 1 E II Chain E, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi 1 20150325 1 pdb | 1J4T 1 F HI Chain F, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi j 20150326 | pdb 1 1J4T|G B Chain G, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi 120150327 | pdb | 1J4T[H III Chain H, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 2) 

qi | 20150328 | pdb 1 1J4U1 A 0 Chain A, Structure Of Artocarpin Complexed With Me-Alpha 

qi 1 20150329 | pdb 1 1J4U1B HI Chain B, Structure Of Artocarpin Complexed With Me-Alpha 

qi | 20150330 I pdb I U4UjC 111 Chain C, Structure Of Artocarpin Complexed With Me-Alpha 

qi | 20150331 | pdb I 1J4U 1 D H Chain D, Structure Of Artocarpin Complexed With Me-Alpha 
Length = 149 

Score =28.2 bits (59), Expect = 20 
Identities - 8/11 (72%), Positives = 10/11 (90%) 

Query: 4 GPAGSGWEEGS 14 

GP G+GW+EGS 
Sbjct: 12 GPGGNGWDEGS 22 



□ > qi | 22122 697 1 ref |NP 666274 . 1 | B hypothetical protein MGC3804 6 [Mus musculus] 

qi | 1934 38 96 1 qb | AAH25600 . 1 | D Hypothetical protein MGC3804 6 [Mus musculus] 
Length = 280 

Score = 28.2 bits (59), Expect = 20 

Identities = 16/28 (57%), Positives - 17/28 (60%), Gaps = 9/28 (32%) 

Query: 6 AGSGWE-EGSGS PPGVTPLFSP 26 

AGSG E EGS + PPG TP FSP 

Sbjct: 34 AGSG-EAEGSSASSPSLPPPG-TPAFSP 59 



□ > qi 1 20150316 | pdb j 1J4S j A 13 Chain A, Structure Of Artocarpin: A Lectin With Mannos 
Specificity (Form 1) 

qi I 20150317 | pdb | 1J4S | B H Chain B, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 1) 

qi | 20150318 |pdb|U4S|C 0 Chain C, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 1) 
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qi I 20150319 1 pdb 1 1J4S I D 13 Chain D, Structure Of Artocarpin: A Lectin With Mannose 
Specificity (Form 1) 
Length = 149 

Score =28.2 bits (59), Expect = 20 
Identities = 8/11 (72%), Positives - 10/11 (90%) 

Query: 4 GPAGSGWEEGS 14 

GP G+GW+EGS 
Sbjct: 12 GPGGNGWDEGS 22 



□ > qi 1 2 634 7287 | dbj | BAC372 92 . 1 j D unnamed protein product [Mus musculus] 
Length = 280 

Score = 28.2 bits (59), Expect = 20 

Identities = 16/28 (57%), Positives = 17/28 (60%), Gaps = 9/28 (32%) 

Query: 6 AGSGWE-EGSGS PPGVTPLFSP 2 6 

AGSG E EGS + PPG TP FSP 

Sbjct: 34 AGSG-EAEGSSASSPSLPPPG-TPAFSP 59 



□ > qi | 50255367 [ qb I EAL18102 . 1 1 hypothetical protein CNBK1230 [Cryptococcus neoforma 
neoformans B-3501A] 
Length = 598 

Score = 28.2 bits (59), Expect - 20 

Identities = 13/29 (44%), Positives = 13/29 (44%), Gaps = 14/29 (48%) 

Query: 1 GMQGPA GSGW EEGSG 15 

GMQGPA G GW EEG G 

Sbjct: 518 GMQGPAGVMAGTGDVGMGWGGIQKEEGGG 54 6 



□ > qi|348735221ref 1XP 220904.2| B similar to hypothetical protein 4932418K24 [Ratt 
Length = 747 

Score =27.8 bits (58), Expect = 27 
Identities = 8/11 (72%), Positives = 10/11 (90%) 

Query: 16 SPPGVTPLFSP 26 

SPPG +PLF+P 
Sbjct: 112 SPPGASPLFTP 122 



□ > qij 309820 | qb | AAA337 85 . 1 1 Antigen 
Length - 417 

Score =27.8 bits (58), Expect = 27 

Identities = 11/16 (68%), Positives = 11/16 (68%), Gaps = 3/16.(18%) 

Query: 3 QGPAGSGWEEGSGSPP 18 
Q PAGS GSGSPP 
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Sbjct: 160 QAPAGSS GSGSPP 172 



□ > qi | 16041188 | dbj I BAB69761. 1 
Length = 135 



hypothetical protein [Macaca f ascicularis ] 



Score = 27.8 bits (58), Expect = 27 

Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 9/28 (32%) 



Query: 4 



GPAG-SGWEEGSGSP 
GP G SGWE G SP 



PGVTP 22 
P VTP 



Sbjct: 83 GPVGASGWEAGCASPQPTSLTPYPRVTP 110 



□ > qi j 34899222 [ref | NP 910957.1| alcohol dehydrogenase-like protein-contains EST C7 
[Oryza sativa (japonica cultivar-group) ] 
Length =4 62 

Score =27.8 bits (58), Expect - 27 

Identities = 12/19 (63%), Positives - 12/19 (63%), Gaps = 2/19 (10%) 

Query: 5 PAGSGWEEGSGSPPGVTPL 23 

PAGSG E GSPP V L 
Sbjct: 4 05 PAGSGRGE— GSPPAVKSL 4 21 



□ > qi j 48 8457 52 | ref | ZP_00300025 . 1 1 COG3209: Rhs family protein [Geobacter metallire 
Length - 2294 

Score = 27.4 bits (57), Expect = 36 

Identities = 12/19 (63%), Positives = 13/19 (68%), Gaps = 2/19 (10%) 

Query: 4 GPAGSGWEEGSGSPPGVTP 22 

GPAG GW E G+P VTP 
Sbjct: 214 GPAGPGWLEVQGTP — VTP 230 



Score = 21.8 bits (44), Expect = 1638 

Identities = 8/11 (72%), Positives = 9/11 (81%), Gaps = 1/11 (9%) 

Query: 12 EGS-GSPPGVT 21 

+GS SPPGVT 
Sbjct: 137 6 DGSFASPPGVT 138 6 



D > qi 1 15226565 | ref | NP 17 97 4 3 . 1 [ protein kinase family protein [Arabidopsis thalian 
gi | 25387138 | pir | jG84601 probable protein kinase [imported] - Arabidopsis thaliana 
qi 14567279! qb | AAD236 92 . 1 1 putative protein kinase [Arabidopsis thaliana] 
Length =871 

Score = 27.4 bits (57), Expect - 36 
Identities = 8/10 (80%), Positives = 9/10 (90%) 

Query: 17 PPGVTPLFSP 26 
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PPGVTPL + P 
Sbjct: 286 PPGVTPLIAP 295 



□ > qi I 2134685 Ipir | [153411 acetylcholinesterase (EC 3.1.1.7), minor splice form - h 
( fragment ) 

qi | 54 5225 | qb | AAC6Q618 . 1 | Q acetylcholinesterase; acetylcholine acetylhydrolase; A 
sapiens] 
Length = 72 

Score =27.4 bits (57), Expect = 36 
Identities = 8/8 (100%), Positives = 8/8 (100%) 

Query: 1 GMQGPAGS 8 

GMQGPAGS 
Sbjct: 4 GMQGPAGS 11 



□ > qi | 25151456 [ refj NP 741140.1| II prion-like Q/N-rich domain protein PQN-25, Prion 

domain protein (pqn-25) [Caenorhabditis elegans] 
gi|16604137| gb 1 AAL27237 . 1 | Prion-like- (q/n-rich) -domain-bearing protein protein 2 

[Caenorhabditis elegans] 

Length = 672 

Score =27.4 bits (57), Expect = 36 
Identities = 8/9 (88%), Positives = 9/9 (100%) 



Query: 17 PPGVTPLFS 25 

PPG+TPLFS 
Sbjct: 51 PPGLTPLFS 59 



□ > qi |300897411qb | AAP2084 5. 1 I hypothetical protein [Oryza sativa (japonica cultiva 
Length = 50 

Score =27.4 bits (57), Expect - 36 
Identities = 8/10 (80%), Positives = 9/10 (90%) 

Query: 6 AGSGWEEGSG 15 

AG+GWEEG G 
Sbjct: 2 6 AGAGWEEGGG 35 



□ > qi | 4 99385 | emb | CAA5351 1 . 1 | H collectin-43 [Bos taurus] 
Length = 301 

Score = 27.4 bits (57), Expect = 36 

Identities = 15/26 (57%), Positives = 16/26 (61%), Gaps = 6/26 (23%) 

Query: 1 GMQGPAG SGWEEGSGSPPGVTP 22 

GM GPAG SG +GS PPG TP 
Sbjct: 50 GMPGPAGREGPSGR-QGSMGPPG-TP 7 3 
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□ > qi| 50355694 |ref |NP 001002237. 1 1 D collectin-43 [Bos taurus] 

qil27923966|spjP42916|CL43 BOVIN D Collectin-43 precursor (CL-43) (43 kDa collect 

qi 1 18252109 | qb 1 AAL61855. 1 I D 43kDa collectin precursor [Bos taurus] 

qi 1 18252111 | qb j AAL61856 . 1 1 D 43kDa collectin precursor [Bos taurus] 
Length = 321 

Score =27.4 bits (57), Expect - 36 

Identities = 15/26 (57%), Positives = 16/26 (61%), Gaps = 6/26 (23%) 

Query: 1 GMQGPAG SGWEEGSGSPPGVTP 22 

GM GPAG SG +GS PPG TP 
Sbjct: 70 GMPGPAGREGPSGR-QGSMGPPG-TP 93 



□ > qil 1083017 Ipirj IA53570 D collectin-43 - bovine 
Length = 301 

Score =27.4 bits (57), Expect = 36 

Identities = 15/26 (57%), Positives = 16/26 (61%), Gaps = 6/26 (23%) 

Query: 1 GMQGPAG SGWEEGSGSPPGVTP 22 

GM GPAG SG +GS PPG TP 
Sbjct: 50 GMPGPAGREGPSGR-QGSMGPPG-TP 73 



□ > qi | 45546404 | ref 1 ZP 00186489.11 COG0642: Signal transduction histidine kinase [R 
xylanophilus DSM 9941] 
Length = 1280 

Score = 26.9 bits (56), Expect = 48 
Identities = 9/13 (69%), Positives = 10/13 (76%) 

Query: 5 PAGSGWEEGSGSP 17 

P+ SG EEG GSP 
Sbjct: 1023 PSSSGREEGGGSP 1035 



□ > qi I 38086162 | ref [XP 133316. 2| D similar to hypothetical protein [Mus musculus] 
Length = 616 

Score = 26.9 bits (56), Expect = 48 

Identities = 11/20 (55%), Positives = 12/20 (60%), Gaps - 4/20 (20%) 

Query: 7 GSGWEEGSG SPPGVTP 22 

GSGWEEG G + P V P 
Sbjct: 521 GSGWEEGEGRTTSTEPSVIP 540 



□ > qi |2 08 089451 ref | NP_624116 . 1 1 hypothetical protein [Thermoanaerobacter tengconge 
qi [ 2 0517 608 | qb | AAM257 20. 1 1 hypothetical protein TTE2596 [Thermoanaerobacter tengc 
MB 4 ] 

Length = 68 
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